Hierarchical Clustering for Semi-Supervised Ground Truth Generation

نویسندگان

  • Medha Baidya
  • Samuel Maddock
  • Sazzadur Rahaman
  • Jason Ziglar
چکیده

Supervised learning tasks can require a large collection of labeled data for accurate pattern recognition. For recognition of handwritten characters, manually producing ground truths can be very tedious. In this paper, we propose a semisupervised hierarchical clustering method to reduce the necessary amount of human effort required for labeling a dataset of handwritten characters. The experimental results demonstrate that the approach can improve labeling accuracy over baseline methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comprehensive Cross-Hierarchy Cluster Agreement Evaluation

Hierarchical clustering represents a family of widely used clustering approaches that can organize objects into a hierarchy based on the similarity in objects’ feature values. One significant obstacle facing hierarchical clustering research today is the lack of general and robust evaluation methods. Existing works rely on a range of evaluation techniques including both internal (no ground-truth...

متن کامل

Active semi-supervised fuzzy clustering

Clustering algorithms are increasingly employed for the categorization of image databases, in order to provide users with database overviews and make their access more effective. By including information provided by the user, the categorization process can produce results that come closer to user’s expectations. To make such a semi-supervised categorization approach acceptable for the user, thi...

متن کامل

A Semi-Supervised Clustering Method Based on Graph Contraction and Spectral Graph Theory

Semi-supervised learning is a machine learning framework where learning from data is conducted by utilizing a small amount of labeled data as well as a large amount of unlabeled data (Chapelle et al., 2006). It has been intensively studied in data mining and machine learning communities recently. One of the reasons is that, it can alleviate the time-consuming effort to collect “ground truth” la...

متن کامل

Semi-supervised Fuzzy Clustering Algorithms for Change Detection in Remote Sensing Images

For the problem of change detection it is difficult to have sufficient amount of ground truth information that is needed in supervised learning. On the contrary it is easy to identify a few labeled patterns by the experts. In this situation to avoid wastage of available information semi-supervision is suggestible to enhance the performance of unsupervised ones. Here we present the fuzzy cluster...

متن کامل

SSHMT: Semi-supervised Hierarchical Merge Tree for Electron Microscopy Image Segmentation

Region-based methods have proven necessary for improving segmentation accuracy of neuronal structures in electron microscopy (EM) images. Most region-based segmentation methods use a scoring function to determine region merging. Such functions are usually learned with supervised algorithms that demand considerable ground truth data, which are costly to collect. We propose a semi-supervised appr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015